arXiv:astro-ph/9811086vl 4 Nov 1998 


Simultaneous Multicolor Detection of Faint Galaxies in the Hubble 

Deep Field^ 


Alexander S. Szalay, Andrew J. Connolly and Gynla P. Szokoly 

Department of Physics and Astronomy, 

The Johns Hopkins University, Baltimore MD 21218 


ABSTRACT 

We present a novel way to detect objects when mnltiband images are available. 
Typically, object detection is performed in one of the available bands or on a somewhat 
arbitrarily co-added image. Onr techniqne provides an almost optimal way to nse all 
the color information available. We bnild np a composite image of the N passbands 
where each pixel valne corresponds to the probability that the given pixel is jnst sky. By 
knowing the probability distribntion of sky pixels (a distribntion with N degrees of 
freedom), the data can be nsed to derive the distribntion of pixels dominated by object 
flnx. From the two distribntions an optimal segmentation threshold can be determined. 
Clipping the probability image at this threshold yields a mask, where pixels nnlikely 
to be sky are tagged. After nsing a standard connected-pixel criterion, the regions of 
this mask dehne the detected objects. Applying this techniqne to the Hnbble Deep Field 
data, we hnd that we can extend the detection limit of the data below that possible nsing 
linearly co-added images. We also discnss possible ways of enhancing object detection 
probabilities for certain well dehned classes of objects by nsing varions optimized linear 
combinations of the pixel fluxes (optimal subspace hltering). 

Subject headings: object detection, multicolors, faint galaxies 


^ Based on observations made with the NASA/ESA Hubble Space Telescope, obtained from the data archive at the Space Telescope 
Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under cooperative agreement 
with the National Science Foundation 
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1. Introduction 


Object detection, especially at faint flux limits, has 
always been considered something of a “black art”. 
This is despite the fact that all techniques use the 
same basic steps, i.e. filtering of the images with a 
PSF-like kernel, clipping of the filtered data using a 
heuristically chosen threshold and then rejecting ob¬ 
jects with too few connected pixels (FOCAS — Valdes 
1982, Sextractor — Bertin & Arnouts 1996). Each 
of these methods works well in the high signal-to- 
noise regime, easily detecting all objects that lie sig¬ 
nificantly above the sky noise. There is, however, a 
substantial difference in the efficiency of the object 
detection process, dependent on how we filter the im¬ 
ages and what thresholds are applied to identify ob¬ 
jects as the detection threshold approaches the noise 
level. 

When multiband photometry is available, the prob¬ 
lem of how to systematically detect objects in all pass- 
bands becomes significantly more complex. There are 
two basic approaches that are commonly used in the 
astronomical literature. Object detection is under¬ 
taken in a single passband (or a co-added image) with 
a matched aperture used to measure the fluxes of the 
object in the additional passbands or object detec¬ 
tion is applied to each passband independently and 
the resulting catalogs are merged to give a master list. 
Each of these approaches has its own set of problems. 
Using a single image to derive a catalog can miss ob¬ 
jects with unusual colors or objects at the limit of 
detection in a single passband (e.g. it is easy to imag¬ 
ine that if we set an object detection limit at 5a for 
each pixel in a single band we will miss objects which 
would be a 4.her detection in each individual pass- 
band but if we had considered all bands together we 
would have accepted this pixel with a very high sig¬ 
nificance) . Detecting independently in each passband 
requires matching of objects that may be detected 
in only one passband or be resolved into a different 
number of components dependent on the passband 
the object was observed in. 

In this paper we attempt to deal with these de¬ 
tection problems in a simple and objective fashion 
that segments images by taking into account all the 
color information available. In Section 2 we describe 
the statistical basis and the formal description of our 
technique. Sections 3 contains the description of our 
technique as applied to the Hubble Deep Field, the 
ultimate faint multicolor dataset that has been a chal¬ 


lenge for advanced processing techniques. In Section 4 
we discuss the results, and the use of optimal subspace 
filtering to enhance the detection of fainter objects of 
a specific color. 

2. Multicolor Segmentation of Object Pixel 
from Sky Noise 

2.1. Transformation of the data into a prob¬ 
ability image 

Consider a region on the sky that has been imaged 
in N different photometric bands all of which have 
been registered to a common scale and size, thus every 
pixel contains N numbers, the measured fluxes fi,i = 
1..N. We will further assume that the images are sky 
noise limited, i.e. instrumental sources of pixel noise 
are irrelevant compared to the photon noise of the 
sky background. In order to proceed further we first 
determine the mean fXi and the Gaussian dispersion ai 
of the sky in each band. This can be performed using 
standard techniques, e.g. by studying the distribution 
of pixels in the vicinity of the mode of the pixel values. 
In the next step each image is transformed into one 
where the sky is a normal Gaussian with zero mean 
and unit dispersion. Each pixel is thus described by 
the A^-dimensional vector 

9i = — —(I) 

CTi 

If there were no objects in the image, only sky, each 
component of these vectors would be distributed as a 
normal Gaussian. The probability distribution of y, 
the squared length of the vector, 

N 

y = ^9l ( 2 ) 

i=l 

is then a with N degrees of freedom and each value 
of y is uniquely associated with the probability den¬ 
sity, 

dP{y) = e-y/^y-/--^dy. (3) 

Since this is a monotonic function with respect to 
y, we can create a new image consisting of the 
values. Any value of a pixel in this image can be 
immediately interpreted as the cumulative probabil¬ 
ity P{> y) of how likely this pixel is drawn from 
the sky distribution. This probability can be com¬ 
puted or read from a standard table. In the actual 


2 



“probability”-image we will just use the y values for 
convenience since the mapping is monotonic. 

If we have performed filtering on the original im¬ 
ages (e.g. with a PSF-like kernel), the pixel values 
may be correlated as Wg{9), even if the original sky 
pixels were not. It was shown (Adler 1981), that in 
the special case of 2D random fields the field has 
a correlation function Wy{9) = 2Nwg{9)‘^, thus in the 
filtered and non-filtered images we have an analytic 
understanding of how the statistical properties behave 
under our null hypothesis (i.e. a purely sky noise im¬ 
age). 

2.2. Segmentation of the Probability Image, 
and the Optimal Choice of Threshold 

The next step is to consider the deviations from 
our null-hypothesis, i.e. that all pixels are drawn 
from a Gaussian sky. This will be done by clipping 
the x^-image at a threshold high enough that a pixel 
is unlikely to be drawn from the sky at a particular 
significance level. Figure 1 shows the distribution of 
the R = ^ values derived from the Hubble Deep 
Field (see Section 3). The long-tailed contribution 
due to the presence of objects within the images is 
clearly visible. In principle we could choose an ar¬ 
bitrary threshold in the y values to separate those 
pixels dominated by object flux from the sky pixels. 
As we know the expected probability distribution for 
the sky pixels (a x^ distribution) we can approach 
this problem in an objective fashion. 

For small values of y we know that the contribu¬ 
tion to the flux in each pixel will be due to the sky 
and that the distribution will be that of a x^- In 
Figure 1 the solid line shows the fitted x^ distribu¬ 
tion and the dashed line the difference between the 
X^ and the observed distribution (i.e. the contribu¬ 
tion due to the objects). The point where the two 
lines intersect determines the so called “optimal Bayes 
threshold” (Fukunaga 1990), resulting in the smallest 
possible total error, i.e. the sum of erroneously tag¬ 
ging a sky pixel as an object, or missing an object 
pixel. If a different ’cost’ for the two kinds of errors 
is adopted, making one more expensive, the optimal 
threshold can be computed from minimizing that par¬ 
ticular cost function. For example, we may want to 
set our threshold much lower, at the expense of many 
false detections, in order to find all the objects in the 
field. 

Clipping at this threshold results in a binary mask 


image, where every pixel is tagged with a value of 0, 
if it is consistent with sky, and 1 if it is unlikely to 
be sky. The next step in the processing is the cre¬ 
ation of contiguous regions. Objects which consist of 
too small a number of pixels need to be rejected. In 
principle such a distribution can be analytically cal¬ 
culated, at least in 2 dimensions, in the high threshold 
approximation (Adler 1978), but we propose an em¬ 
pirical approach: it is easy to generate entirely Gaus¬ 
sian random images, then apply the same filtering, 
thresholding and segmentation as for the data above. 
The region statistics computed from such images will 
then yield the distribution for our null hypothesis. 
Subtracting this from the data will give us the ex¬ 
pected distribution of the object sizes. An optimal 
threshold is then again determined from these two 
distributions, as above. 

3. Faint Object Detection 

In the following section we describe the applica¬ 
tion of these techniques to the multicolor U^qq, i? 45 o, 
Veoe and / 8 i 4 photometry of the Hubble Deep Field 
(HDF; Williams et al. 1996). To compare the x^ im¬ 
age detection algorithm with the standard methods 
we restrict our analysis to a single WFPG2 chip (chip 
4) taken from Version 2 of the drizzled HDF data. 
We further restrict the data we consider by excluding 
the outer 200 pixels from each image to remove those 
regions of each frame where the noise is considerably 
higher, due to the dithering pattern. The final frames 
comprise, for each of the four passbands, 1781x1741 
images with 0.04 arcsec pixels. 

In the construction of these images they have al¬ 
ready undergone a fairly detailed reduction process, 
which includes compensation for bad pixels and cos¬ 
mic ray removal. They still require, however, addi¬ 
tional processing in order to derive a pixel set with 
more uniform statistical properties. 

3.1. Preprocessing of the images 

3.1.1. Flattening the image 

If the mean sky background differs from zero across 
the frame (i.e. the sky has not been accurately sub¬ 
tracted from the images) the y image will not ap¬ 
proximate a x^ distribution, even without the pres¬ 
ence of objects on a frame. In fact how well the y 
image matches a x^ distribution, for low values of y, 
is a very sensitive test of the background subtraction 
technique. We know from examining the HDF data 
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that they suffer from residual scattered light, partic¬ 
ularly in the C /300 passband (Williams et al. 1996). 
This arises as an X pattern, due to the shadowing of 
the scattered light by the mirror supports, superim¬ 
posed on the background sky and occurs at the few 
percent level. We must, therefore, fit and remove this 
contribution before constructing the y image. 

As this is an additive contribution, it can be sub¬ 
tracted - its effect on the level of sky noise is minimal. 
To create a smooth sky we derive a mask, correspond¬ 
ing to a very generous cut of non-sky pixels. We set 
our threshold to select the upper 16% of pixels in 
a variance image measured over a 3x3 pixel neigh¬ 
borhood. To safely stay away from the wings of ob¬ 
jects we grow this mask outwards by 5 pixels. These 
masked pixels defined the object regions. Next, we 
grow these regions by an additional 14 pixels. This 
provides an annulus of sky around each object. Pixels 
within neighboring object regions that intersect these 
annuli have been masked off. The mean value of the 
sky has been computed within these annuli and the 
central regions corresponding to objects filled-in with 
this value. Finally, we smooth this background image 
image with a 46 pixel wide Gaussian filter and sub¬ 
tract it from the original frame (creating images with 
zero mean sky). 

3.1.2. Normalizing to unit dispersion 

The final step in preprocessing the images is to con¬ 
volve with a 3 pixel wide symmetric Gaussian kernel 
to perform the usual PSF filtering. This step removes 
the arbitrariness of the location of pixel boundaries, 
and increases the S/N of the data. The HDF im¬ 
ages have also a rather peculiar correlation due to the 
pattern of the drizzling. By choosing a filter slightly 
broader than this width, we are insensitive to the de¬ 
tails of the drizzling. 

We then build a histogram of the pixel values in 
each of the bands and measure the dispersion of the 
sky. This is done by fitting an inverted parabola to the 
log of the pixel histogram, using only the data range 
around the modal value, extending slightly more to 
the negative. The fits were found to be excellent. The 
images were then divided by the dispersion, resulting 
in zero mean, unit dispersion Gaussian images. 

3.2. Building the image 

To build up the V image we sum the squares 
of the individual pixel values in the four normalized 


images. From this we construct the probability his¬ 
togram (see Figure 1). For the HDF data we choose 
to consider the distribution of the square root, R of 
the y values (i.e. R = ^). We do this because R 
corresponds to a signal-to-noise statistic (as opposed 
to the squared signal-to-noise of y) and can, there¬ 
fore, be more naturally compared with the standard 
techniques of thresholding images at a given multiple 
of the sky noise. R is the length of the A^-dimensional 
pixel vector. The normalized probability distribution 
for R, with 4 degrees of freedom, corresponds to 

dP{R) = ^ 4 ) 

In Figure 1 we show the distribution of pixel val¬ 
ues. As noted in Section 2.2 the long positive tail is 
due to the contribution from objects within the im¬ 
age frame. We fit the probability distribution given 
in Equation 4 to the points below the peak of the his¬ 
togram (i.e. i? < 2 ) by varying the number of sky 
pixels as the only free parameter. The fit is shown 
as a dotted line within Figure 1. The difference be¬ 
tween the fitted “sky” distribution and the observed 
histogram is given by the dashed line. This repre¬ 
sents the probability distribution of pixels occupied 
by object flux. The intersection of these two lines 
provides the Bayesian threshold (where the incom¬ 
pleteness and contamination errors, at a pixel level, 
are equal). We choose this crossover point (i? = 3.73) 
as the threshold used in the image detection and anal¬ 
ysis routines, in order to demonstrate our ideas. This 
is a rather conservative cut, as we shall see, since this 
corresponds to a 2.42 (t per pixel pixel detection. 

3.3. Object detection 

Object detection is undertaken using Sextractor 
v2.0.8 (Bertin & Arnouts 1996). This version of Sex- 
tractor allows the detection and analysis algorithms 
to be run on separate images (i.e. we can detect ob¬ 
jects on one frame and measure the photometric prop¬ 
erties of these detected objects from a separate im¬ 
age) . We, therefore, use the R image for detection and 
measure the fluxes and magnitudes for a fixed set of 
apertures in the [/ 300 , B 450 , Veoe and / 8 i 4 passbands. 
As all detections are undertaken using a single image 
this procedure alleviates the task of matching catalogs 
from multiple passbands (where images from different 
filters may be undetected in a particular passband or 
broken up into multiple images). 

The intersection of the sky and object distributions 
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corresponds to an R value of i? = 3.73. This is equiv¬ 
alent to a probability of 0.9924 that a pixel value in 
the convolved image is not due to the background sky. 
We adopt this relatively high probability because the 
detected objects are unlikely to be contaminated by 
spurious sources and, as the goal of this paper, is to 
demonstrate the applicability of our new technique 
and to compare it with standard techniques, we do 
not wish to push the initial application into the noise. 

It should be noted that Sextractor, as with most 
object detection packages, fits and subtracts a sky 
image before applying a detection threshold. As the 
image is, by definition, a positive definite quan¬ 
tity, subtracting a background will affect the detec¬ 
tion threshold. We have, therefore, modified the code 
to exclude background subtraction for the detection 
image. To derive fluxes in the photometric passbands 
for each detected object, matched aperture photome¬ 
try was then performed on the C/300, /?450, Vooe and 
/814 images separately. 

Of course, when searching for faint objects, one 
may abandon the choice of the optimal Bayes clas¬ 
sifier, rather one would set a much lower detection 
threshold. Such a threshold will shift the balance 
between the two errors, in essence we detect all the 
real objects, at the expense of a lot of false detec¬ 
tions. In such a case one can use some more heuristic 
constraints to reduce the number of these spurious 
sources. For example, one can require that in the de¬ 
tected objects all of the fluxes be non-negative, or dif¬ 
fer from zero by less than Icr, or one can require that 
at least one of the bands has a positive flux. Since 
the sky noise has a symmetry around zero in each of 
the bands, one can subdivide our detections into 2^ 
‘hyper-quadrants’, according to the sign of their fluxes 
in each of the N bands. In the all-negative quadrant, 
all objects are false detections. Since there should be 
a similar number in the other ‘hyper-quadrants’, we 
can use the all-negatives as an estimator of the false 
detections. In practice, at the limit of the R = 3.73 
threshold, this filtering of the catalog makes little dif¬ 
ference to the number of detected sources. 

4. Discussion 

4.1. Comparison of the detection to stan¬ 
dard techniques 

As we have shown in Section 2 the detection 
algorithm provides an almost optimal use of the mul¬ 
ticolor information. We now compare the results from 


the HDF data with the standard detection techniques 
found in the literature. Specifically we consider three 
cases, one where the initial detection is undertaken on 
the I band image alone (using matched apertures on 
the other 3 passbands to derive the multicolor pho¬ 
tometry), second where we detect galaxies using the 
sum of the V and I images (as was done for the HDF 
data in Williams et al. (1996)) and a third case where 
we co-add all four passbands weighted by the inverse 
of their sky noise (the weighting of the passbands is 
given by the vector in Equation 5). 

To provide a direct comparison between these dif¬ 
fering techniques we need to detect galaxies to a com¬ 
mon threshold. The R = 3.73 threshold corresponds 
to a probability of 0.9924 that the flux in a pixel in 
the convolved image exceeds that due to the sky alone 
(i.e. contains object flux). For a Gaussian sky distri¬ 
bution with zero mean this is equivalent to a 2.43 (t 
deviation. We, therefore, convolve the I and V + I 
images with a kernel of 3 pixels FWHM (as was done 
for the original images used to create the y^) and use 
Sextractor to derive an object catalog with a 2.43(J 
threshold. Note that this deliberately conservative 
threshold is substantially higher than that applied to 
the HDF data by Williams et al. (1996). 

Figure 2 shows the comparison between these dif¬ 
ferent approaches (for each of the four passbands), 
with the y^ technique shown as a long dashed line, 
the weighted image {PCI) as a short dashed line, the 
V + I detection as a solid line and the I detection 
as a long then short-dashed line. For bright galaxies 
(/ < 27.5) each of the three techniques find similar 
numbers of objects on a frame. As we move to fainter 
magnitude limits the numbers of objects detected in 
the PCI, V + I and y^ images clearly exceed those 
found in the I band image. The magnitude limit for 
the detection of objects in the PCI and V + I im¬ 
ages are approximately 0.4 magnitudes fainter than 
that determined from the I image. The comparable 
depths of the PCI and V + I catalogs is not surpris¬ 
ing. The difference between the weight vector for the 
PCI image and that for the V+I image is only 17 de¬ 
grees. Therefore, the V + I image provides essentially 
an optimal linear combination of the four passbands 
(given the relative depths of the data). It differs from 
the PCI image only in that it is not sensitive to very 
blue objects. The y^ detection limit extends, how¬ 
ever, 0.4 magnitudes deeper than either of the PCI 
and V + I image. 

The difference between the different detection tech- 
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niques on the histogram is striking. Since the thresh¬ 
olds were locked together through the pixel tail prob¬ 
abilities, one obvious reason could be that simply tak¬ 
ing a lower threshold in the Gaussian detections would 
have reproduced the result of the clipping. There 
will be obviously some differences, due to the different 
behavior of the excursion sets (regions consisting of 
pixels above a threshold) for Gaussian and x^ fields. 
Imagine an object in a single band image consisting of 
m pixels, all exactly at our selection threshold. The 
sky noise will statistically move half the pixels below 
the detection threshold, while the other half will re¬ 
main above. For a x^: in th® high threshold limit, 
approximately the same thing happens, but as we ap¬ 
proach lower and lower fluxes, it is more and more 
likely that the effect of sky noise will be such, that 
the pixel remains above the threshold, thus the de¬ 
structive interference is much smaller, at the expense 
of more false detections, of course. 

Last, but not least it is also possible, that we sim¬ 
ply detect objects with such colors, that were un¬ 
detected in the single (or co-added) Gaussian band. 
This can be seen in Figure 3 where we show the dis¬ 
tribution of {B — V) colors of objects found in the 
PCI and x^ images. The solid histogram shows the 
color distribution for objects selected from the PCI 
images and the open histogram the color distribution 
from the x^ image. The weighting of the PCI se¬ 
lection preferentially selects objects with intrinsically 
redder colors (with a mean color of B — V = 0.46). 
The x^ image, by contrast, has a more symmetric 
color distribution with a mean color B — V = 0.37. 
Also note, that the scatter in the color distribution is 
considerably broader for the x^ detections. 

We show that this is a significant effect, by splitting 
the objects into two categories: those detected in both 
the PCI and R images and those detected in R only 
(there were no objects found in PCI but not in R). 
Next, we create a histogram of the number of these 
detections versus the peak of the R value inside the 
object. In Figure 4 we show this distribution with 
those objects found in the PCI and R images shown 
by a solid line and those found in R alone shown by 
a dashed line. We can see, that there are objects, un¬ 
detected in PCI, that have a very significant value of 
R. 

In Figure 5 we show the B, V and I images of 4 of 
those objects which had significant R values {R > 10) 
but were not detected in the PCI image. Together 
with the optical passbands we show the R image for 


each object. It is clear from these images that those 
objects selected in the R frame but not in PCI are 
predominantly bright in a single passband (possibly 
due to strong emission lines). A linear combination of 
the available passbands (whether the V +1 combina¬ 
tion or the PCI weighted sum) does not maximize the 
signal-to-noise of the resultant image for these galax¬ 
ies with unusual colors. The R image does, however, 
provide an optimal combination of the available pass- 
bands and can be used to derive a sample of galaxies 
that is not biased towards a specific type while maxi¬ 
mizing the information present in the multicolor data. 

4.2. Optimal Detection of Objects of Specific 
Color 

The x^ technique we have described above is the 
optimal way for detecting objects within an image 
with no prior information. In this section we outline 
a possible extension to our technique given a model 
for the colors of the objects we expect to detect. 

In our four-dimensional flux system (for the HDF) 
every pixel is a 4D vector. Sky pixels are uniformly 
scattered, all directions are equally probable. Ob¬ 
jects or pixels of a specific color on the other hand 
all point to the same direction, denoted by ft. Since 
the noise with our scaling is spherically symmetric, 
the component of the noise in this direction is also 
a normal Gaussian. Thus by ignoring all the other 
components, i.e. taking a linear combination of the 
different images along the ft direction, we discard 3 of 
the 4 noise components, an effective increase in our 
signal-to-noise of 2. Thus we are keeping as much 
of the signal as possible, while including as little of 
the noise as possible. Similarly, we can define a 2 di¬ 
mensional subset, where most of the objects lie, and 
we can create a x^ image in that 2D subset of the 
4D color space. These techniques are called, for the 
obvious reason, “Optimal Subspace Filtering”. In or¬ 
der to determine the optimal directions, we have per¬ 
formed a Principal Component Analysis for all the 
pixels which were above our threshold of 3.73 ct. The 
resulting vector is the following: 

n= (0.075,0.29,0.66,0.69) (5) 

Note, that this is quite near to the V + I, the cosine 
of the angle is 0.95. This would be the preferred di¬ 
rection to use to go after objects of ‘typical’ colors, 
but it would have the danger of losing faint objects 
at the extreme ends of the color distribution. 
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If we know colors of our preferred objects a pri¬ 
ori, i.e. we are focusing on a narrow range of galaxy 
colors, we can use the linear combination along the 
corresponding direction to enhance our detection. In 
such a way one can develop particularly sensitive de¬ 
tections of specihc classes of object. 

4.3. Conclusions 

We have formulated an object detection algorithm 
that is capable of using all the multicolor information 
available in a given field. Applying this algorithm to 
the Hubble Deep Field we have shown that it per¬ 
forms substantially better than standard image anal¬ 
ysis techniques. Given a certain probability thresh¬ 
old, we find that the y^-based detection technique can 
identify faint objects of unusual colors, where detec¬ 
tions on single co-added images may fail. We outline 
several heuristics on how false detections can be re¬ 
duced, should one adopt a very low detection thresh¬ 
old, using the isotropy properties of the multidimen¬ 
sional sky-noise and we discuss how to optimize these 
detections for objects of specihc colors. Given the in¬ 
creasing commonality of large multicolor photomet¬ 
ric surveys we believe that the adoption of techniques 
such as we describe here would signihcantly improve 
the resulting object catalogs. 

We would like to thank the HDF team for making 
their data so rapidly available to the public. Most 
of our image processing has been implemented using 
KBVision, helpful advice from Pete Eggleston (AAI) 
is happily acknowledged in this task. 
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Eig. 1.— The distribution of pixels in the i?-image 
is shown for chip 4 of the HDE data set. The solid 
line shows expected distribution for a with 4 de¬ 
grees of freedom. The longer positive tail is due to the 
contribution of pixels containing object flux. When 
we subtract the expected distribution for the sky we 
derive the dashed line (showing the distribution of 
object pixels). Where these two distributions cross is 
the Bayes threshold where the error due to misclas- 
sifying object pixels as sky and sky pixels as objects 
is equal. For chip 4 this threshold is i? = 3.73 corre¬ 
sponding to a probability of 0.9924 that a pixel is not 
drawn from the sky distribution. 


Fig. 2.— The number counts derived from Ghip 4 
of the HDF data sets for the U 300 , B 450 , keoe and 
/814 passbands. The long-dashed line shows the num¬ 
ber counts derived using the y^ image for detection, 
the short-dashed line the number counts from the 
weighted sum of all passbands (PCI), the solid line 
the counts from the V +1 images and the long-short 
dashed line from the I image. Each detection tech¬ 
nique has been applied at the same probability limit 
(0.9924 that the pixel is not drawn from the sky). In 
the bright limit / < 27.5 all detection techniques find 
similar numbers of objects. At fainter magnitude lim¬ 
its the PCI and V + I detection exceed that of the 
I image by approximately 0.4 magnitudes and the y^ 
image exceeds these by a further 0.4 magnitudes. 


Fig. 3.— The color distribution P 450 — Veoe of objects 
detected in the HDF images. The filled histogram is 
derived from the objects detected from the PCI im¬ 
age and the unfilled histogram from the y^ image. 
The PCI image detection results in a color distribu¬ 
tion that is skewed to the red in P 450 —Heoe color. The 
y^ image provides a more Gaussian distribution of ob¬ 
jects centered on a mean color of P 450 — Veoe = 0.37. 


Fig. 4.— The solid line indicates the distribution of 
objects detected in the PCI image, as a function of 
the peak R value within the object, while the dashed 
line corresponds to objects detected in the y^ image, 
and undetected in the PCI. Note, that there are 
several objects at rather high R values. 
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Fig. 5.— Cutouts of a few objects with odd colors, 
which were detected in the image, but remained 
undetected in PCI. The leftmost bitmap shows the 
R image (the distance from the origin in multicolor 
S/N space), while the next three show the flattened 
B, V and / images. Each detected object appears to 
be bright in a single passband (possibly due to strong 
emission lines) and weak in the other two. All of 
these bands contribute to the signal in the R image. 
Note, how obvious the objects look in the R image, 
compared to the other three. The 4 objects shown 
can be identified in the original drizzled WFPC2 chip 
4 images at coordinates (a) x = 1958 y = 1016, (b) 
x = M2y= 1649 (c) a; = 1891 y = 1062 (d) a; = 1413 
y = 1586. 
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